A Graph RAG implementation using pure vector search with Milvus
Project description
Vector Graph RAG
Graph RAG with pure vector search — no graph database needed.
💡 Encode entities and relations as vectors in Milvus, replace iterative LLM agents with a single reranking pass — achieve state-of-the-art multi-hop retrieval at a fraction of the operational and computational cost.
✨ Features
- No Graph Database Required — Pure vector search with Milvus, no Neo4j or other graph databases needed
- Single-Pass LLM Reranking — One LLM call to rerank, no iterative agent loops (unlike IRCoT or multi-step reflection)
- Knowledge-Intensive Friendly — Optimized for domains with dense factual content: legal, finance, medical, literature, etc.
- Zero Configuration — Uses Milvus Lite by default, works out of the box with a single file
- Multi-hop Reasoning — Subgraph expansion enables complex multi-hop question answering
- State-of-the-Art Performance — 87.8% avg Recall@5 on multi-hop QA benchmarks, outperforming HippoRAG
📦 Installation
pip install vector-graph-rag
# or
uv add vector-graph-rag
With document loaders (PDF, DOCX, web pages)
pip install "vector-graph-rag[loaders]"
# or
uv add "vector-graph-rag[loaders]"
🚀 Quick Start
from vector_graph_rag import VectorGraphRAG
rag = VectorGraphRAG() # reads OPENAI_API_KEY from environment
rag.add_texts([
"Albert Einstein developed the theory of relativity.",
"The theory of relativity revolutionized our understanding of space and time.",
])
result = rag.query("What did Einstein develop?")
print(result.answer)
Note: Set
OPENAI_API_KEYenvironment variable before running.
📄 With pre-extracted triplets — click to expand
Skip LLM extraction if you already have knowledge graph triplets:
rag.add_documents_with_triplets([
{
"passage": "Einstein developed relativity at Princeton.",
"triplets": [
["Einstein", "developed", "relativity"],
["Einstein", "worked at", "Princeton"],
],
},
])
🌐 Import from URLs and files — click to expand
from vector_graph_rag import VectorGraphRAG
from vector_graph_rag.loaders import DocumentImporter
# Import from URLs, PDFs, DOCX, etc. (with automatic chunking)
importer = DocumentImporter(chunk_size=1000, chunk_overlap=200)
result = importer.import_sources([
"https://en.wikipedia.org/wiki/Albert_Einstein",
"/path/to/document.pdf",
"/path/to/report.docx",
])
rag = VectorGraphRAG(milvus_uri="./my_graph.db")
rag.add_documents(result.documents, extract_triplets=True)
result = rag.query("What did Einstein discover?")
print(result.answer)
⚙️ Custom configuration — click to expand
rag = VectorGraphRAG(
milvus_uri="./my_data.db", # or remote Milvus / Zilliz Cloud
llm_model="gpt-4o",
embedding_model="text-embedding-3-large",
collection_prefix="my_project", # isolate multiple datasets
)
All settings can also be configured via environment variables with VGRAG_ prefix or a .env file:
VGRAG_LLM_MODEL=gpt-4o
VGRAG_EMBEDDING_MODEL=text-embedding-3-large
VGRAG_MILVUS_URI=http://localhost:19530
📖 Full Python API reference → Python API docs
🔬 How It Works
Indexing:
Documents → Triplet Extraction (LLM) → Entities + Relations → Embedding → Milvus
Query:
Question → Entity Extraction → Vector Search → Subgraph Expansion → LLM Reranking → Answer
Example: "What did Einstein develop?"
- Extract entity:
Einstein - Vector search finds similar entities and relations in Milvus
- Subgraph expansion collects neighboring relations
- Single-pass LLM reranking selects the most relevant passages
- Generate answer from selected passages
📖 Detailed pipeline walkthrough with diagrams → How It Works · Design Philosophy
📊 Evaluation Results
Evaluated on three multi-hop QA benchmarks (Recall@5):
| Method | MuSiQue | HotpotQA | 2WikiMultiHopQA | Average |
|---|---|---|---|---|
| Naive RAG | 55.6% | 90.8% | 73.7% | 73.4% |
| IRCoT + HippoRAG¹ | 57.6% | 83.0% | 93.9% | 78.2% |
| HippoRAG 2² | 74.7% | 96.3% | 90.4% | 87.1% |
| Vector Graph RAG | 73.0% | 96.3% | 94.1% | 87.8% |
¹ HippoRAG (NeurIPS 2024) ² HippoRAG 2 (2025)
📖 Detailed analysis and reproduction steps → Evaluation
🗄️ Milvus Backend
Just change milvus_uri to switch between deployment modes:
Milvus Lite (default) — zero config, single-process, data stored in a local file. Great for prototyping and small datasets:
rag = VectorGraphRAG(milvus_uri="./my_graph.db") # just works
⭐ Zilliz Cloud — fully managed, free tier available — sign up 👇:
rag = VectorGraphRAG(
milvus_uri="https://in03-xxx.api.gcp-us-west1.zillizcloud.com",
milvus_token="your-api-key",
)
⭐ Sign up for a free Zilliz Cloud cluster
You can sign up on Zilliz Cloud to get a free cluster and API key.
Self-hosted Milvus Server (Docker) — for advanced users
If you need a dedicated Milvus instance for multi-user or team environments, you can deploy Milvus standalone with Docker Compose. This requires Docker and some infrastructure knowledge. See the official installation guide for detailed steps.
rag = VectorGraphRAG(milvus_uri="http://localhost:19530")
🖥️ Frontend & REST API
Vector Graph RAG includes a React-based frontend for interactive graph visualization and a FastAPI backend.
# Backend
uv sync --extra api
uv run uvicorn vector_graph_rag.api.app:app --host 0.0.0.0 --port 8000
# Frontend
cd frontend && npm install && npm run dev
| Endpoint | Method | Description |
|---|---|---|
/api/health |
GET | Health check |
/api/graphs |
GET | List available graphs |
/api/graph/{name}/stats |
GET | Get graph statistics |
/api/query |
POST | Query the knowledge graph |
/api/documents |
POST | Add documents |
/api/import |
POST | Import from URLs/paths |
/api/upload |
POST | Upload files |
See API docs at http://localhost:8000/docs after starting the server.
📖 Full endpoint reference → REST API docs · Frontend guide
📚 Links
- Documentation — full guides, API reference, and architecture details
- How It Works — pipeline walkthrough with diagrams
- Design Philosophy — why pure vector search, no graph DB
- Milvus — the vector database powering Vector Graph RAG
- FAQ — common questions and troubleshooting
Contributing
Bug reports, feature requests, and pull requests are welcome! For questions and discussions, join us on Discord.
📄 License
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file vector_graph_rag-0.1.3.tar.gz.
File metadata
- Download URL: vector_graph_rag-0.1.3.tar.gz
- Upload date:
- Size: 31.2 MB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
2dd933b7a00398b9ee6be229e6e7b5a48f7f597d0e84d94e81c521de5a81a8ed
|
|
| MD5 |
471b60190b7bedf3125e05a8145a8fe5
|
|
| BLAKE2b-256 |
1d6a2edb476fbc014168793263d4766d93fc53a1175a8fc2cade5423c4aaa994
|
Provenance
The following attestation bundles were made for vector_graph_rag-0.1.3.tar.gz:
Publisher:
release.yml on zilliztech/vector-graph-rag
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
vector_graph_rag-0.1.3.tar.gz -
Subject digest:
2dd933b7a00398b9ee6be229e6e7b5a48f7f597d0e84d94e81c521de5a81a8ed - Sigstore transparency entry: 1252391178
- Sigstore integration time:
-
Permalink:
zilliztech/vector-graph-rag@b56a3961489c09f7d5b3b4c8b0686184efaa40fd -
Branch / Tag:
refs/tags/v0.1.3 - Owner: https://github.com/zilliztech
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release.yml@b56a3961489c09f7d5b3b4c8b0686184efaa40fd -
Trigger Event:
push
-
Statement type:
File details
Details for the file vector_graph_rag-0.1.3-py3-none-any.whl.
File metadata
- Download URL: vector_graph_rag-0.1.3-py3-none-any.whl
- Upload date:
- Size: 65.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
fd049ef7e791d96d5127111311f801c83b36aaf5a832bcc0ceeace6375c86736
|
|
| MD5 |
c1b1b853508b37c7dcab14dc2c9c2cb5
|
|
| BLAKE2b-256 |
f1b086fa4f5a831c9035216503674da4506afdf972af9f23e6b6f5c7bd4b1801
|
Provenance
The following attestation bundles were made for vector_graph_rag-0.1.3-py3-none-any.whl:
Publisher:
release.yml on zilliztech/vector-graph-rag
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
vector_graph_rag-0.1.3-py3-none-any.whl -
Subject digest:
fd049ef7e791d96d5127111311f801c83b36aaf5a832bcc0ceeace6375c86736 - Sigstore transparency entry: 1252391186
- Sigstore integration time:
-
Permalink:
zilliztech/vector-graph-rag@b56a3961489c09f7d5b3b4c8b0686184efaa40fd -
Branch / Tag:
refs/tags/v0.1.3 - Owner: https://github.com/zilliztech
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release.yml@b56a3961489c09f7d5b3b4c8b0686184efaa40fd -
Trigger Event:
push
-
Statement type: